Using Label Propagation for Learning Temporally Abstract Actions in Reinforcement Learning
نویسندگان
چکیده
Temporal abstraction plays a key role in scaling up reinforcement learning algorithms. While learning and planning with given temporally extended actions has been well studied, the topic of how to construct this type of abstraction automatically from data is still open. We propose to use the label propagation algorithm for community detection in order to construct extended actions, within the framework of options. We illustrate the benefit of the approach in small computational experiments and discuss its relationship to existing methods for subgoal discovery.
منابع مشابه
Tree Based Hierarchical Reinforcement Learning
In this thesis we investigate methods for speeding up automatic control algorithms. Specifically, we provide new abstraction techniques for Reinforcement Learning and Semi-Markov Decision Processes (SMDPs). We introduce the use of policies as temporally abstract actions. This is different from previous definitions of temporally abstract actions as we do not have termination criteria. We provide...
متن کاملMacro - Actions in Reinforcement Learning : An EmpiricalAnalysisAmy McGovern and Richard
Several researchers have proposed reinforcement learning methods that obtain advantages in learning by using temporally extended actions, or macro-actions, but none has carefully analyzed what these advantages are. In this paper, we separate and analyze two advantages of using macro-actions in reinforcement learning: the eeect on exploratory behavior, independent of learning, and the eeect on t...
متن کاملMacro Actions in Reinforcement Learning An Empirical Analysis
Several researchers have proposed reinforcement learning methods that obtain ad vantages in learning by using temporally extended actions or macro actions but none has carefully analyzed what these advantages are In this paper we separate and an alyze two advantages of using macro actions in reinforcement learning the e ect on exploratory behavior independent of learning and the e ect on the sp...
متن کاملTheoretical Results on Reinforcement Learning with Temporally Abstract Options
We present new theoretical results on planning within the framework of temporally abstract reinforcement learning (Precup & Sutton, 1997; Sutton, 1995). Temporal abstraction is a key step in any decision making system that involves planning and prediction. In temporally abstract reinforcement learning, the agent is allowed to choose among ”options”, whole courses of action that may be temporall...
متن کاملMulti-time Models for Temporally Abstract Planning
Planning and learning at multiple levels of temporal abstraction is a key problem for artificial intelligence. In this paper we summarize an approach to this problem based on the mathematical framework of Markov decision processes and reinforcement learning. Current model-based reinforcement learning is based on one-step models that cannot represent common-sense higher-level actions, such as go...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013